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ABSTRACT 


iV'Dt  / 


The  definition  of  the  Drazin  inverse  of  a square  matrix 

with  complex  elements  is  extended  to  rectangular  matrices  by 

showing  that  for  any  B and  W , m by  n and  n by  m , 

respectively,  there  exists  a unique  matrix,  X , such  that 
k k+1 

(BW)  = (BW)  XW,  for  some  positive  integer  k , XWBWX=X 
and  BWX=XWB.  Various  expressions  satisfied  by  B,  W,  X and 
related  matrices  are  developed. 
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SIGNIFICANCE  AND  EXPLANATION 

Efficient  methods  for  handling  systems  of  linear  simultaneous 
algebraic  equations  provide  a fundamental  tool  in  solving  problems  in 
almost  every  area  of  computing  and  applied  mathematics.  In  matrix 
notation,  the  solution  of  Ax  = b is  given  by  x = A ^b,  where  A 1 
denotes  the  inverse  of  A . 

In  classical  matrix  algebra,  the  inverse  of  A exists  only  if 
A is  square  and  "nonsingular."  The  unique  inverse  then  satisfies 
AA  ^ = A ^A  = I.  A singular  or  rectangular  matrix  has  no  inverse 
in  this  sense.  However  there  may  be  associated  with  it  a variety  of 
"generalized  inverses",  each  having  some  of  the  properties  of  the 
usual  inverse.  For  example  suppose  that  we  have  m equations  in  n 
unknowns.  Ax  = b,  where  A is  m * n,  m > n,  rank  n.  The  solution 
in  a least  squares  sense  is 

T -IT 

x = (A  A)  -Vb 

and  the  matrix  multiplying  b is  a generalized  inverse. 

A certain  type  of  generalized  inverse,  the  Drazin  inverse,  has 
heretofore  been  defined  only  for  square  (usually  singular)  matrices 
and  has  found  application  to  Markov  processes  and  to  the  solution  of 
systems  of  ordinary  differential  equations.  In  this  paper  the  defini- 
tion of  the  Drazin  inverse  is  extended  to  rectangular  matrices  and 
its  properties  studied.  (This  extended  Drazin  inverse  is  defined  in  the 
Abstract. ) 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC , and  not  with  the  authors  of  this  report. 


A DRAZIN  INVERSE  FOR  RECTANGULAR  MATRICES 


R.  E.  Cline  and  T.  N.  E.  Greville 

1.  INTRODUCTION. 

Let  A be  any  rectangular  matrix  with  complex  elements.  Then  the  Moore- 
Penrose  inverse  of  A is  the  unique  matrix  X = A+  such  that 
(1.1)  AXA  = A,  XAX  = X,  (AX)H  = AX,  (XA)H  = XA 


(where  the  superscript  H denotes  conjugate  transpose) . On  the  other  hand,  if 

A is  square,  the  Drazin  inverse  of  A is  the  unique  matrix  X = A,  such  that 

d 


(1.2) 

k k+1 

A = A X,  for  some  positive  integer 

k , 

(1.3) 

X = x2a  , 

(1.4) 

AX  = XA, 

It  is 

the  purpose  of  this  paper  to  show  that  a Drazin  inverse  can  be 

defined 

for  rectangular  matrices  in  such  a way  that  both  A+  and 

A (when 
a 

A is  square) 

follow  as  special  cases. 

2.  THE  W- WEIGHTED  DRAZIN  INVBRSE  OF  B . 

Although  the  Drazin  inverse  was  originally  considered  for  elements  in  an 
associative  ring  [3]  and  Lemma  1 was  established  in  that  context  [2],  we  use 
this  result  only  for  matrices  and  restate  it  accordingly. 

LEMMA  Is  For  any  matrices  B and  W , m by  n and  n by  m,  respectively, 

(BW).  = B (WB) jW. 
d d 

The  reader  can  also  verify  Lemma  1 by  taking  X equal  to  the  right  member 

of  the  equation  and  A = BW  and  verifying  that  (1.2)- (1.4)  are  satisfied. 

Using  (1.4)  to  rewrite  (1.3)  as  A.  = A AA  , the  expressions  in  Corollary 

d d d 

1.1  now  follow  at  once  by  induction. 
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COROLLARY  1.1:  For  every  positive  integer  p 


r 


t 


V 


W (BW)P  = (WB) PW 
a a 

and 

B (WB)P  = (BW)PB . 
d d 

Our  first  result,  Theorem  2,  is  established  for  an  arbitrary  positive 
integer  p . As  will  be  indicated  following  the  proof,  however,  the  general 
case  can  always  be  reduced  to  p = 2 by  a simple  transformation. 

THEOREM  2:  For  any  matrices  B and  W , and  for  every  positive  integer  p , 
there  is  a unique  X such  that 

(2.1)  (BW) .XW  = (BW)P  , 

d d 

(2.2)  BWX  = XWB 

(2.3)  BW (BW)  X = X. 

d 

Also,  there  is  a unique  X such  that 

(2.4)  XW  = BW  (BW)  j , 

a 

(2.5)  WX  = WB (WB)P 

a 

(2.6)  XW(BW)P_1X  = X . 

The  unique  matrix  X which  satisfies  both  sets  of  equations  is 

(2.7)  X = B(WB)P  . 

a 

Proof : Using  (1.3)  and  Corollary  1.1,  it  is  easily  seen  that  X in  (2.7) 
satisfies  (2.1)  to  (2.6). 

To  establish  uniqueness,  we  show  first  that  (2.1),  (2.2)  and  (2.3)  imply 

(2.4),  (2.5)  and  (2.6),  and  then  that  (2.4),  (2.5)  and  (2.6)  imply  (2.7). 

Now  XW  = BW (BW)  XW  = BW (BW)P , by  (2.3)  and  (2.1).  Thus  (2.4)  holds,  and 
a a 

combines  with  (2.3)  to  give 

XW(BW)P_1X  = BW(BW)P (BW)P_1X  = BW(BW).X  = X . 

d d 

Thus  (2.6)  also  holds.  Finally,  using  (2.2),  (2.3),  (2.4)  and  Corollary  1.1 
we  have 


1 

J 
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WX  = WBW  (BW)  X = (WB)  .WBWX 
a a 


= (WB)  WXWB  = (WB)  WBW (BW)PB 
d d d 

= (WB)2(WB)P+1  = (WB) (WB) j , 
d d 

that  is,  (2.5).  Hence  (2.1),  (2.2)  and  (2.3)  imply  (2.4),  (2.5)  and  (2.6). 

If  (2.4),  (2.5)  and  (2.6)  hold,  then 

X = XW(BW)P_1X  = BW(BW)P (BW)P_1X 

a 

= (BW).BWX  = (BW) BWB (WB)P 
a da 

= BWB (WB)P+1  = B(WB)P  . 

d d 

Hence  (2.7)  holds.  ■ 

Observe  next  that  if  p >.  1 , q >_  -1  and  r >_  0 are  integers  such  that 

q+2r+2  = p,  and  if  we  let  (WB)q  = (WB)  when  q = -1 , then  B(WB)P  = 

d a 

B(WB)^[ ( (WB)rw) (B(WB)q) ]2.  Consequently,  considerations  of  X in  Theorem  2 

d 

can  always  be  reduced  to  the  case  p = 2 if  B is  replaced  by  B(WB)^  and 

£ 

W is  replaced  by  (WB)  W. 

2 

COROLLARY  2.1:  The  matrix  X = B(WB).  is  the  unique  solution  to  the  equations 

d 

k k+1 

(2.8)  (BW)  = (BW)  XW,  for  some  positive  integer  k , 

(2.9)  X = XWBWX  , 


(2.10) 


BWX  = XWB  . 


Proof:  That  X = B(WB).  is  a solution  is  apparent  by  notinq  that  the  relations 



in  (2.10)  and  (2.2)  are  identical,  (2.9)  is  (2.6)  when  p = 2 and,  with 


XW  = (BW)  by  (2.4),  (2.8)  is  (1.2)  for  A = BW  . 
d 

To  show  uniqueness,  suppose  both  X^  and  X^  are  solutions  to  (2.8)  for 
some  positive  integers  k^  and  , respectively,  (2.9)  and  (2.10).  Let 
k = max  (k^ , k^ ) . Then  using  repeated  applications  of  equations  (2.8),  (2.9), 
and  (2.10)  we  nave  s 
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X1  = = BWXj^WXj^  = (BW)2  (X1W)2X1 

k k k+1  k 

= ...  = (bw)  (x1w)Kx1  = (bw>*  Ax  w(x  w)*x 

k+1  k k k 

= x2(wb)  wtt^w)  x1  = x2wbw(bw)  (x^w)  x1 

= X^WBWX,  . 

2 1 

Continuing  in  a similar  manner,  X^BWX^  can  be  reduced  to  X2 . Thus 
2 

X = B (WB)  is  unique.  ■ 

2 

It  should  be  noted  in  Corollary  2.1  that  B , = B (WB) , when  B is  square 

d d 

and  W = I.  Also,  there  is  a direct  correspondence  between  the  relations  in 
(2.8)  and  (1.2),  (2.9)  and  (1.3)  when  written  as  X = XAX,  and  (2.10)  and  (1.4) 
in  which  the  role  of  W in  (2.8),  (2.9)  and  (2.10)  is  to  act  as  a "sandwich" 
matrix  so  that  products  such  as  BWX  and  XWB  can  be  defined.  In  view  of  the 
correspondence  between  the  defining  equations  for  A^  and  those  in  Corollary 
2.1,  we  define  the  Drazin  inverse  of  a rectangular  matrix  in  the  following 
manner: 

DEFINITION  1:  For  any  matrices  B and  W , m by  n and  n by  m , re- 

2 

spectively,  the  matrix  X = B(WB)^  is  called  the  W-weighted  Drazin  inverse  of 

B,  and  is  written  as  X = B,  . . 

— d,  W 

Given  a square  matrix,  A , the  smallest  positive  integer  k such  that 

(1.2)  holds  is  called  the  index  of  A , and  it  can  be  shown  that  k is  the 

k k+1 

minimal  power  for  which  A and  A have  the  same  rank.  Moreover, 

2 

(A^)^  - A A^  , so  that  (A^)^  = A if  and  only  if  A has  index  one,  whereas 

{ ^d ) d ) d = always  has  index  one  [3].  In  Theorem  3 we  characerize 

roatrices,  X , such  that  X = B for  some  W in  terms  of  matrices  with 

Cl  f w 

index  one. 

THEOREM  3:  Let  B and  X be  any  m by  n matrices.  Then  X = B,  for 

d , W 

some  W if  and  only  if  X has  the  form 
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(2.11) 


X = BYBYB 


3.  THE  B- WEIGHTED  DRAZIN  INVERSE  OF  W . 


For  any  B and  W , m by  n and  n by  m , respectively,  inter- 


changing the  roles  of  B and  W throughout  §2  provides  a completely  dual  set 


of  results  for  W.,  _ = W(BW).,,  the  B-weighted  Drazin  inverse  of  W . However, 
d , B d 


H + 

whereas  B , „ has  the  size  of  B , W.  „ has  the  size  of  B and  B . Anal- 
d,W  d,B 


ogous  to  the  observation  in  §2  that  B,  = B.,  when  B is  square  and  W = I, 

d d , W 


we  now  consider  choices  of  W for  which  B = W 


d,  B 


(2) 


Following  the  notation  of  [1],  let  B = {z|ZBZ  = Z}.  Then  W e B 

d , B 


(2) 


for  any  W such  that  B = W 


d , B 


LEMMA  4:  W e B 
d,  B 


(2) 


if  and  only  if  W,  „ = W(BW),  . 

d,  B d 


(2 ) 3 2 

Proof:  If  W,  „ e B , then  W(BW),  = W(BWK  so  that  (BWK  is  idempotent. 

d,B  d d d 


Conversely,  W(BW)^  e B 


(2) 


An  immediate  consequence  of  Lemma  4 is  that  only  expressions  of  the  form 


B = W(BW),  must  be  considered  when  determining  choices  of  W . Moreover, 
d 


+ H + h+  + 

observe  that  since  A = A,  for  any  Hermitian  matrix  and  (BB  ) = B B for 

d 


any  B , then 
(3.1) 


B+  = B+BB+  = bVV  = BH(BBH) 


d • 


H 


Therefore,  one  choice  of  W is  B . To  characterize  the  class  of  all  matrices 


which  can  be  used  to  form  B in  this  manner,  it  is  convenient  to  replace  W 


by  C , where  C has  the  size  of  B . 


DEFINITION  2:  For  any  m by  n matrices  B and  C , C is  said  to  be  alias 


to  B if  B+  = CH(BCH) 


d ' 


Given  any  m by  n matrix  B with  rank  r , B = EF  is  said  to  be  a 


H 


full  rank  factorization  if  E and  F have  r columns  [1] . In  this  case 


+ + + + H “1  H + H H “1 

B = F E with  E = (E  E)  E a left  inverse  of  E and  F = F (FF  ) a 


right  inverse  of  F . 


Matrices  alias  to  B are  now  characterized  using  a full  rank  factoriza- 
tion of  B . In  the  proof  of  Theorem  5 we  use  the  facts  [4]  that  the  general 
solution  of  a consistent  system  of  equations  AXB  = C can  be  written  as 

(3.2)  X = XQ  + Y - A+AYBB+  , 

where  XQ  is  any  particular  solution  and  Y is  arbitrary,  and  that  necessary 
and  sufficient  conditions  for  the  equations  AX  = B,  XC  = D to  have  a common 
solution  are  that  each  is  consistent  and  AD  = BC  in  which  case 

(3.3)  XQ  = A+B  + DC+  - A+ADC+ 
is  a solution. 


THEOREM  5:  Let  B = EF  be  any  full  rank  factorization.  Then  C is  alias  to 
B if  and  only  if  C has  the  form 

(3.4)  C = ESF  + Z 

U 

where  S is  any  nonsingular  matrix  and  Z is  any  matrix  such  that  BZ  =0 

|J 

and  Z B = 0 . 

Proof;  If  C has  the  form  in  (3.4)  where  S and  Z satisfy  the  hypotheses, 
then 


CH(BCH)  = (CHB)  oH=  (FHsHEHEFKFHsHEH 
d d a 

H H H HH.  H 
= F S (E  EFF  S ) .E 
a 

H H . H.  -1  . H,  -1  ,H,  -1H 
= F S (S  ) (FF  ) (EE)  E 


= F+E+  = B+  . 


Thus  C is  alias  to  B . 

To  prove  the  converse,  suppose  first  that  B has  either  full  row  rank  or 

full  column  ranks  If  B has  full  row  rank  and  C is  alias  to  B,  then 

B+  = BH(BBH)_1  = CH(BCV  implies  that  BCH  is  nonsingular.  Hence  C = S B 

d -L 

H H ”1  -f 

where  S.^  = CB  (BB  ) = CB  is  nonsingular.  By  a similar  type  of  argument, 

C = BS2  with  S2  nonsingular  if  B has  full  column  rank.  Consequently,  (3.4) 
holds  with  Z = 0 in  both  cases. 
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Suppose  now  that  B has  neither  full  row  rank  nor  full  column  rank.  With 


B = EF  a full  rank  factorization  and  C alias  to  B 
implies 

E+  = FCH(EFCH)j 

U 

and 


then  F+E+  = CH(BCH)j 

d 


F+  = CH(EFCH)jE  = JHE (FCHE) , . 

d d 

H H 

Therefore,  CF  is  alias  to  E and  EC  is  alias  to  F , so  that  there  exist 
nonsingular  matrices  Q and  R , say,  for  which 

(3.5)  FCH  = QEH 


and 

(3.6)  CHE  = FHR. 


Using  (3.3)  it  then  follows  that 

(3.7)  XQ  = F+QEH  + FHRE+.=  F+FFHRE+  = F+£)EH 
is  a solution  to  (3.5),  (3.6)  and  thus  to 

(3.8)  FCHE  = QEHE  . 

But  by  (3.2),  all  solutions  of  (3.8)  can  be  written  as 

(3.9)  CH  = XQ  + ZH 

H + + 

where  Z = Y - F FYEE  with  Y arbitrary.  Consequently  those  solutions  in 

(3.9)  such  that  (3.5)  and  (3.6)  also  hold  must  satisfy  FZH  = 0 and  ZHE  = 0, 
H H 

which  implies  BZ  = 0 and  Z B = 0.  Finally,  the  form  in  (3.4)  is  obtained 

from  (3.9)  by  noting  that  the  last  expression  for  XQ  in  (3.7)  can  be  written 
H H H H H —I 

as  Xg  = F S E with  S = (FF  ) Q nonsingular.  ■ 

In  Corollary  5.1  we  characterize  those  matrices  in  (3.4)  with  Z = 0. 
COROLLARY  5.1:  For  any  full  rank  factorization  B = EF,  the  set  of  matrices 

alias  to  B of  the  form  C = ESF  is  an  equivalence  class. 


-8- 


Proof : 


The  relation  is  reflexive  with  S = I,  that  is,  (3.1).  Since  C = E(SF) 
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